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Abstract 



fvq We give a new (1 + e)-approxiniation for sparsest cut problem on graphs where small sets 

. expand significantly more than the sparsest cut (sets of size n/r expand by a factor Vlog " log "^ 

rS I bigger, for some small r; this condition holds for many natural graph families). We give two 

■^^ different algorithms. One involves Guruswami-Sinop rounding on the level-r Lasserre relaxation. 

The other is combinatorial and involves a new notion called Small Set Expander Flows (inspired 

, by the expander flows of [ARV09]) which we show exists in the input graph. Both algorithms 

run in time 2'^('"^poly(n). 

We also show similar approximation algorithms in graphs with genus g with an analogous 
local expansion condition. 



t-H This is the first algorithm we know of that achieves (1 + e)-approximation on such general 



(/J family of graphs. 

O 



This paper concerns a new and promising analysis of Lasserre [Las02]/Parrilo [Par03] SDP re- 



,— I 1 Introduction 

> 

in 

^f^ laxations for the (uniform) SPARSEST CUT problem, which gives (1 + e)-approximation on several 

cn natural families of graphs. Note that Lasserre/Parillo relaxations subsume all relaxations for the 

■^ problem that were previously analysed: the spectral technique of Alon-Cheeger [AM85], the LP 

•w relaxation of Leighton-Rao [LR99] with approximation ratio O(logn), and the SDP with triangle 



inequality of Arora, Rao, Vazirani [ARV09] with approximation ratio 0{y/logn). The approxima- 
tion ratio of 0{-\/logn) has proven resistant to improvement in almost a decade (and there is some 
evidence the ratio may be tight for the ARV relaxation; see Lee-Sidiropoulos[LSll]). For a few 
rS families of graphs such as graphs of constant genus, an 0(l)-approximation is known. 

c3 Recently, there has been increasing optimism among experts that Lasserre [Las02]/Parrilo [Par03] 

relaxations — which are actually a hierarchy of increasingly tighter relaxations whose rth level can 
be solved in n^'^' time — may provide better approximation algorithms for sparsest cut as well 
as other problems such as MAX CUT and unique games, and possibly even refute Khot's unique 
games conjecture. For instance Barak, Raghavendra, and Steurer [BRSll], relying on the earlier 
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subexponential algorithm of Arora, Barak, Steurer [ABSIO], showed that Lasserre relaxations can 
be used to design subexponential algorithms for the unique games problem. Independently, Gu- 
ruswami and Sinop [GSll] gave another rounding that looks quite different but yielded very similar 
results. Subsequently, Barak et al. [BHK^12] showed that Lasserre relaxations can easily dispose of 
families of unique games instances that seemed "difficult" for simpler SDP relaxations: many fam- 
ilies of instances can be solved near-optimally in 4-8 rounds! This result was subsequently extended 
by O'Donnell and Zhou [0Z13] to "difficult" families of graphs from [DKSV06] which are integral- 
ity gaps for uniform sparsest cut and balanced separator. Of course, it is unclear whether this 
demonstrates the power of these relaxations, or merely the limitations of our current lowerbound 
approaches. Nevertheless, the rise in researchers' hopes for better algorithms is palpable. 

But the stumbling blocks in this quest are also quite clear. First, known ideas for analysing 
Lasserre relaxations generally require some condition on the r*^ eigenvalue of the Laplacian for some 
small r, whereupon some f{r,e) levels of Lasserre are shown to suffice for (1 + e)-approximation. 
Unfortunately, many real-life graphs (eg, even the 2D-grid) do not satisfy this eigenvalue condition 
so new ideas seem needed. 

Another stumbling block has been the inability to relate these new rounding algorithms for 
Lasserre relaxations to existing SDP rounding algorithms such as Goemans- Williamson and ARV. 
Since Lasserre relaxations greatly generalize normal SDP relaxations, one would like general purpose 
rounding algorithms which for small r reduce to earlier rounding algorithms. A concrete question 
is: does the Guruswami-Sinop rounding algorithm always give an approximation ratio as good as 
the ARV ^/iogn for sparsest cut once r is sufficiently large? This is still unclear. 

The current paper makes some progress on these stumbling blocks. We show that the GS 
rounding algorithm achieves (1 + e)-approximation for sparsest cut on an interesting family 
graphs that are not small set expanders and may not have large rth eigenvalue. If (piocai denotes 
the minimum sparsity of sets of size n/r, and (^sparsest the minimum sparsity among all sets, 
then we require (piocal/4>sparsest > Vlognlogr. Note that 4>local is often larger than (^sparsest in 
natural families of graphs. For example, in normalized d-dimensional n^' x . . . x n^' -grid graphs, 
<P sparsest < ^^, 4>iocai > ^ in) whereas A,. <C 3 (^) . Note that when the condition is not 
met, a simple modification of our algorithm returns a subset of size n/r that has sparsity ^log n log r 
times 4) sparsest- Thus setting r = 0(1) one recovers the ARV bound — though the analysis of this 
case also uses ARV^. 

Comparison with existing work. As mentioned, earlier analyses of Lasserre relaxations require 
a lowerbound on the r eigenvalue of the graph: the tightest such result from [GS13] requires 
\r > 4> sparsest- Efforts to get around such limitations have focused on understanding structure of 
graphs which do not satisfy the eigenvalue condition: an example is the so-called high order Cheeger 
inequality of [ABSIO] (improved by Louis et al [LRTV12] and Lee et al. [LGT12]) according to 
which -roughly speaking — a graph with many eigenvalues close to o(l) have a small nonexpanding 
set. In other words, the graphs are not Small- Set Expanders^. However, there is an inherent 



^We also know how to achieve quahtatively similar results as our main result using BRS rounding + ARV ideas 
applied to Lasserre solutions at the expense of stricter requirements on small set expansion. However, that method 
seems unable to give better than 0(l)-approximation, whereas GS rounding is able to give (1 + e). 

^In fact, the unexpected appearance of Small Set Expansion (SSE) in this setting is believed to not be a fluke. 
It appears in the SSE conjecture of Raghavendra and Steurer [RSIO] (known to imply the UGC), their "Unique 
games with SSE" conjecture, as well as in the known subexponential algorithms for unique game. Furthermore, 
attempts to construct difficult examples for known SDP-based algorithms also end up using graphs (such as the noisy 



Cheeger-like gap {(j) vs y^) between eigenvalues and expansion that seems to limit the possible 
improvements. Our algorithms work even without a bound on the r eigenvalue; they only need 
bounds on expansion (The d-dimensional grids are good examples.) Furthermore, they yield (1 + e)- 
approximation, which in context of SPARSEST CUT seems quite surprising. 

Subsequent to our work and inspired by it, Gharan and Trevisan [GT13] have shown how to 
obtain factor 0{\/logk) approximation from the basic ARV relaxation for the sparsest cut problem 
under local expansion or spectral conditions. 

Better algorithms for bounded genus graphs. Recall that for genus g graphs there are 
known 0(log(7)-approximation algorithms for SPARSEST CUT [LSIO]. We can show that GS'13 
rounding gives a (1 + e)-approximation if (piocai > ^{-^^)4'sparsest- Thus for the 2D-grid, it implies 
that 0(l/e^) rounds of Lasserre yield a (1 + e)-approximation. Again, when the local expansion 
condition is not satisfied our algorithm finds a witnessing small set, allowing us to recover the 
existing 0(logg) approximation for the general case. 

Combinatorial algorithm. In addition to the above Lasserre-based algorithm, we also give a 
new combinatorial algorithm with similar (but somewhat weaker) guarantees. This algorithm is 
inspired by the primal-dual algorithms for sparsest cut stemming from the expander flows notion 
of ARV (see [AHK04, AK07, She09]). We introduce a new notion called small set expander flows: a 
multicommodity flow whose demand graph is an expander on small sets. Let a (r, d, (3)- flow be an 
undirected multicommodity flow in which d units of flow is incident to each node, and the demand 
graph has expansion /3 on sets of size at most n/r (in other words, the amount of flow leaving the 
set S is d[i\S\). We show that in every graph there is an SSE flow with d = ^{(piocai V^og r/y/log n) , 
/3 = r2((logr)~^), and this flow — or something close to it — can be found in polynomial time. Using 
such flows one can — with some more work — compute a (1 + e)-approximation to sparsest cut as 
above. 

Note that the expander flow idea of ARV was motivated by the observation that expander flows 
consist of a family of dual solutions to the SDP. We suspect that something analogous holds for 
SSE flows and the Lasserre relaxation but are unable to prove this formally. However, we can 
informally show a connection as follows: if a graph has a (r, d, /3)-flow where 

d(3 /logr ^ value of 0(r)-rounds of Lasserre relaxation 

then the integrality gap of the Lasserre relaxation is at most (1 + o(l)). Thus the existence of 
expander flows is another reason — besides the more direct rounding approach mentioned earlier — 
why Lasserre relaxations are near-optimal when 4>iocai 14^ global ^ -v/Iogndogr. 

Applications to semirandom models Recently there has been interest in solving sparsest cut 
on semirandom models of graphs [MMV12]. In these graphs one starts with a planted sparse cut 
in a random graph or expander, and then an adversary is allowed to change some edges. Our work 
provides a new algorithm for one such model, planted combinatorial expander on regular graphs. 
However our results for this model are not directly comparable to the ones in [MMV12]. Our result 
is presented in Appendix C. 

hypercube) which are small set expanders. 



2 Preliminaries and Background 

2.1 Expansion and Graph Laplacian 

Let G = {y, E) be an undirected graph with edge capacities Ce > for all e £ E. For simplicity we 
assume that the input graph is regular with (normalized) degree 1, that is, for all vertices i £ V 
Sj "^(^.i) ~ ^ (^^'^ results in Sections 3 and 4 can also be applied to irregular graphs). We always 
use n to denote the number of vertices in G. 

The expansion of a set is defined as $(5) = minns'i n-|5|| ' where E{A,B) = YlieAjeB^{i,j)- 

The sparsity of a set 4>{S) is defined as T^rk^zjW- There are several problems related to sparsity 
of cuts: 

• The sparsest cut of the graph is a set S that minimizes the sparsity $(S') = igi /„'_|gm ■ We 
use ^sparsest to denote its expansion and (psparsest to denote its sparsity. 

• The edge expansion of a graph is a set S that minimizes the expansion a{S). We use ^global 
to denote its expansion. Notice that since we are working with regular graphs, this is also 
equivalent to the graph conductance problem. 

• The c-balanced separator of a graph is a set S that minimizes the expansion ^{S) among all 
sets of size at least en. We use ^^-balanced ^° denote its expansion. 

While all these problems are closely related (for example, sparsest cut and edge expansion are 
equivalent up to a factor of 2), we carefully differentiate between them in this paper because we 
are looking for 1 + e approximation algorithms. 

We are also interested in the expansion of small sets: let ^riG) be the smallest expansion of a 
set of size at most n/r and (pr{G) be the smallest sparsity of a set of size at most n/r. Sometimes 
when r is fixed (or understood) we drop r and use ^locai and (piocai instead^. 

Notice that the requirement of our algorithms will have the form (piocai/4' global ^ /("-; '^)^- Since 
sparsity (j) and expansion $ are within a factor of 2 ($(5') < 4>{S) < 2$(5)), in such requirements 
the ratios (piocai/4> global and ^ local /^global can be interchanged 

The adjacency matrix A of the graph G is a matrix whose (i, j)-th entry is equal to ciij)- If 
di = ^/j j)eE'^{i,j) denotes the degree of i-th vertex with D being the diagonal matrix of degrees, 
then the Laplacian of the graph G is defined as L = D — A (for regular graph this is just I — A). 
The normalized Laplacian of the graph is defined as £ = D~^''^LD~^''^. 

Graph Laplacians are closely related to the expansion of sets. In particular, the Rayleigh 
Quotient of a vector x, R{x) = ^ ^ ^ is exactly equal to the sparsity of a set S when x is the 
indicator vector of S (and S has size at most 1/2). 

We will denote by (psDP the optimum value of the Lasserre relaxation for sparsest cut. The 
number of levels in the Lasserre hierarchy will be implicit in the context. 

2.2 Lasserre Relaxation and GS Rounding 

We will show sufficient conditions under which r rounds of Lasserre Hierarchy relaxation can be 
rounded to (1 + e)-approximation for sparsest cut and related problems. In particular, we will 

''^locai and 4>iocai usually denote the optimal expansion and sparsity of sets of size at most 0{n/r) 
^f^g means / > Cg for some large universal constant C 



show that the particular rounding algorithm from [GS13] outputs such an approximation. (See Ap- 
pendix A for details on the Lasserre relaxation and the GS rounding algorithm.) 

In general working with Lasserre relaxations involves tedious notation involving subsets of 
variables and assignments to them. Luckily all that has been handled in [GS13], leaving us to work 
with the relatively clean (standard) SDP notation. 

For the sake of simplicity, we will focus on the uniform sparsest cut problem on regular graphs. 
Other variants, such as edge expansion, can easily be handled by changing the objective function. 
Let [x„]ttgy be the vectors corresponding to each node in G obtained as a solution for r-rounds of 
Lasserre Hierarchy relaxation. In particular, x^'s minimize the following ratio: 



2_JU<D ^Uf ll-^M ■^V 



SDP — I ^^ 7. [75- ^ (Psparsest 
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The denominator, whose value we will denote by u, can also be written as: 

|2 



^ V^ II _ ||2 _ v^ II _ 1 Y^ 
n ^-^ ^-^ II n ^-^ 1 

U<V U V 



We will shift each vector Xu by the mean: 

n ^-^ 

V 

so that X;„ Xu = 0. Note: 

II Y' l|2 ^ 1 

We use X = [Xu\ to denote the matrix whose columns correspond to the vectors Xu- Since 
Xu — Xy = Xu — Xy, X £ £2 (i.e. columns of matrix X satisfy the triangle inequality) and: 

2_^Cuv\\Xu — Xu\\ =2_^Cuv\\Xu — Xy\\ < (pSDP— / ^ \\Xu — Xv\\ = (j)SDp\\X\\p 
uv uv u<v 

where last identity follows from the fact that ^u-^-^ ~ ^- Using X, we can re-state Theorem 3.1 
from [GS13] in the following way: 

Theorem 2.1 (Theorem 3.1 from [GS13]). If there exists a subset 5 G ( ,) with 

\\X^X\\l = Y,\\XiXuf <l\\X\\l, (2) 

u 

then the rounding algorithm from [GS13] outputs a set T such that: 

4>SDP 



MT) < 



1-7 



Here Xs is the projection matrix onto the span of the submatrix indexed by S and X^ is the 
projection matrix onto the orthogonal complement of Xs 's column span. 

Furthermore, the SDP solver and rounding procedure can be implemented in time 2'^(^)poly(n) 
using [GS12a]. 



3 Proof via orthogonal separators 

Theorem 2.1 implies that for (l + e)-approximation it suffices to show the existence of a small subset 
S of vertices such that the relative distance of all other vertices to the span of Xs is smaller than 
any small constant. 

Theorem 3.1 (Main). For every e > there is a constant C = C(e) such that the following is true. 
When all subsets of at most 2n/r vertices have sparsity (j)iocai > CcpsDpV^og n log r in the graph, 
there exists a set S of r vertices such that ||X^X|||, < e||X|||,. (Here (psDP < 4>sparsest is the value 
of the Lasserre relaxation for r + 3 rounds and X 's are the corresponding vectors from eq. (1).) 

This existence result will be proven using the orthogonal separators [CMM06] but with the 
modifications of Bansal et al.[BFK+ll], which, not surprisingly, were also developed in context 
of algorithms for small set expansion. (We know how to give a more direct proof without using 
orthogonal separators but it brings in an additional factor of log r in the local expansion condition.) 

Definition 3.2 (Orthogonal Separator). Let X be an £2 space. A distribution over subsets of 
X is called an ?TT--orthogonal separator with distortion D, probability scale a > and separation 
threshold /? < 1 if the following conditions hold for S C X chosen according to this distribution. 

1. For ah Xu G X, Pr[X„ e 5] = a||X„f . 

2. For ah X„,X„ G X with ||X„ -X^f > /3min{||X„f , ||X^f }, 

min{Pr[X„G S],Fv[X^ G S]} 



Pr[X„ G S and X^ e S] < 



m 



3. For all X„,X^ G X, Pr[Is{Xu) ^ IsiX^)] < aD ■ ||X„ - X^f, where Is is the indicator 
function of S. 

Bansal et al. [BFK"'^11] showed the existence of such separators (in the process also giving an 
efficient algorithm to construct them). 

Lemma 3.3 ( [EFK"*" 1 1] ) . For all 13 < 1 there exists an m-orthogonal separator with distortion 

D = Q ( . / log|Jf|logm 



The dependency on /3 follows from calculations in Lemma 4.9 in [CMM06]. From the explanation 
of the above Lemma in [BFK'^ll], we know 7 = V^/^j so the exponent in Lemma 4.9 in [CMM06] is 
1/(1 — 7^) — 1 = 0(/3), and we want (logm-'/m')*^*-^-' to be smaller than 1/m. Setting m' = m^^^'^' 

suffices. Then the distortion is 0{^/\og\X\ logm') = O I J °^ \ogm \ ^ 

Now we show the following, which immediately implies Theorem 3.1. 

Theorem 3.4. For any 5 > Q, 0.25 > /3 > 0, let m = IQr"^ /5. Let D denote the best distortion 
possible for an m-orthogonal separator with separation f3. If X is any set of vectors in £2, one 
for each vertex in the graph, and the minimum expansion (plocal among subsets of at most 2n/r 
vertices satisfies (j)iocai ^ 0{4'sDpD/^), then there exist r points S in X such that \\X^X\\'^p < 
0{5 + P)\\X\\l. 



The actual construction of orthogonal separators from [CMM06] requires the origin to be inside 
the vector set. To achieve this, we will translate all vectors in the same direction: 

Proposition 3.5. // Theorem 2.1 fails, then there exists a set of vectors X £ £2 with £ X . 

Proof. Given the vectors [Xu]u found by Theorem 2.1, we know that ^^^ H-'^mIP = ^^u J2v W-^u — 
X,, IP. Hence there exists some t for which V, \\X,, — X+lP < 2 V, IIX,, IP. After having fixed such 
t, we define our new vectors as X'^ <— X„ — Xt. It is easy to see that X' £ £2, £ X' and for every 
subset of size r — 1, eq. (2) is satisfied (except e becomes e/2, which only changes the constants in 
O notation). D 

We start by showing that most sets in the support of the orthogonal separator should be large. 

Lemma 3.6. If (piocal ^ '^4'SDpD/^ as in the hypothesis of Theorem 3.4, and S is chosen according 
to the orthogonal separator, then ]E[|5'| ■ I\s\<2n/r] — '^^'^[I'S'I]) where I\s\<2n/r ^^ ^^c indicator variable 
for the event "\S\ < 2n/r." 

Proof. On one hand we know 

E[number of edges cut] > E[|S'| • I\s\<2n/r] ■ ^local > mS\ ■ /|S|<2n/r] • (plocalf^- 

On the other hand by condition 3 in the definition, 

E [number of edges cut] < aD N^ Cuv\\Xu — X^\\'^. 
Since ^Cuv\\Xu - X^W^ < (J)sdpY1 II-'^uIP = <PsDP^[\S\]/a, we know 

n\S\ ■ I\s\<2n/r] < -^aDY,Cuv\\Xu " X,f < 6E[\S\]. D 

y^local 

Now we state a corollary but first we need this definition. 
Definition 3.7 (volume). The volume of a subset X' C X is 

Y^ II Y l|2 

,^v'\ l^XueX' ll^«ll 
l^Xnex W^uW 

Corollary 3.8. There exists a subset X' C X with volume at least 1 — 26, such that the following is 
true. Let S' be a set picked probabilistically by first picking S randomly according to the separator 
and letting S' be S if \S\ > 2n/r, and the empty set otherwise. Then we have: 

1. For all Xu G X' we have Pr[X„ G S'] > a\\Xuf /2. 

2. For all Xu,Xy G X with ||X„-X^||2 > /3min{||X„||2, ||X„||2}, 

min{Pr[X„G 5],Pr[X^ G 5]} 



Pr[X« G S' and X^ G S'] < 



m 



Proof. (Sketch) The first condition is by Markov. The second condition holds because the S' is 
always a subset of S, so the probability of LHS only decreases. D 
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Now we are ready to prove Theorem 3.4. 

Proof. (Theorem 3.4) We give an algorithm that shows iteratively picks r points such that most of 
the volume in X' lies close to them. 

1. Initially none of the points are marked. 

2. i^l. 

3. While there is still a point in X' that is not marked: 

(a) Let Xi be the point with largest norm among the unmarked points of X' . 

(b) Pick a set Si {\Si\ > 2n/r) containing Xi, and containing at most 2n/m points that have 
distance more than /3||Xj|p from Xi. (Such a set exists as shown below.) 

(c) Mark all points in Si as well as all points that have distance at most 2/3||Xj|p from Xi. 
Denote by Mi the set of points that were previously unmarked and got marked in this 
step. 

(d) Look over all Mj for j < i and if any points in them have distance at most /3||Xj|p to 
Xi then add them to Mj as well. 

(e) i ^ i + 1. 

First we show using the probabilistic method why we can always perform step 3b. Pick a random 
set 5" from the distribution of the separator, conditioning on its containing Xi. By the properties of 
S' we know if \\Xi - Xy\\^ > /3||Xi|p, then the conditional probability Pr[X^ G S'\Xi G S'] < 2/m. 
So the expected number of points in S' whose distance is at least /3||Xj|p from Xi is at most 2n/m, 
and in particular there must be one set that satisfy the condition. 

Then we need to show that this process terminates in r steps. To do so it suffices to show that 
each Si has at least n/r points that were not in any Sj for j < i. We know that jS'jl > 2n/r. We 
claim its intersection with any Sj for j < z is at most An/ra. The reason is that Xi was unmarked 
at the start of this phase, which implies that that the balls of radius /3||Xj|p and /3||Xj|p around 
Xj and Xj respectively must be disjoint (note that \\Xj\\ > H-'^jII) and thus the only intersections 
among Si,Sj are from points outside these balls, which we know to be at most An/m. Since 
Anr/m < n/r (recall m = Wr'^/5), we have conclude that each Si introduces at least n/r new 
points, so the process must terminate in r steps. 

Finally we bound the average distance of the other points from this set S = {Xi, ...,Xt}{t < r), 

specifically, the quantity uvtJ by 0{6 + /3). 

All points outside X' (the set in Corollary 3.8) anyway have volume at most 26, so their 
contribution is upperbounded by that. To bound the contribution of points in X', consider how 
the sets Mi, ..., Mt were picked. If X^ is /3||Xj|p-close to Xi, then Xu is in Mj (these sets are disjoint 
by the construction). Otherwise X^ belongs to Mj where i is the time that X^ gets marked. 

All points in Mj have norm at most H-'^jIP since otherwise they would have been picked instead 
of Xi. Also more than n/r points (in fact, 2n/r — 2n/m > n/r) in Mj are /3||Xj|p-close to Xi, and 
at most 2n/m points are 2/3||Xj|p-far from Xi, so 

Y, \\Xuf > (|Mj| -2n/m)(l-2/3)||X,f > |M,|||X,||V3. 



On the other hand, after projection to the space orthogonal to Xs, all but 2n/m points are smaller 
than 2/3||Xj|p, therefore after projection 



J2 W^^XuW^ < {\Mi\ - 2n/m) ■ 2/3||Xif + 2n/m ■ \\Xi\\^ < 0{(3 + <5)|Mi| ||X 

X„£Mi 



|2 



Summing up the inequalities for all Mj's we get the upperbound 0{f3-\-S) needed for the theorem. D 

Algorithmic version. Since Bansal et al. [BFK"*"!!] give an efficient algorithm for constructing 
orthogonal separators, the above proof immediately can be made algorithmic. 

Corollary 3.9. There is an algorithm that given a weighted graph G = {V,E) in which (piocai > 

— — 372 —<t>sparsest Computes a (1 + e)- approximation to sparsest cut in time 2'^('')poly(n). 

Here (piocai is the minimum sparsity of sets of size at most 2n/r. 
In fact the algorithm outputs one of the following. 

1. Either a subset with sparsity at most (1 + e)<7!'sDP; 

2. Or a subset of size at most ^ with sparsity at most — — — 3-72 4'SDP- 

Here i?i>sDP is the optimum value of eq. (5) for r + 3 rounds. 

Proof. (Sketch) Consider the algorithm from Theorem 2.1. If it outputs a partition, we are done. 
Otherwise, we apply the algorithm for constructing orthogonal separator in [BFK'^11] on the set 
of vectors as constructed in Proposition 3.5. The above existence proof of the set S fails for this 
set of vectors, therefore Lemma 3.6 fails, and there must be a small set in the orthogonal separator 
that has desired expansion. D 

4 Bounded Genus Graphs 

In this section, we prove an analog of our result for graphs with orientable genus g. The standard 
LP relaxation [LR99] for sparsest cut on such graphs has an integrality gap of O(logg') [LSIO]. 
For planar graphs (when g = 0), Park and Phillips [PP93] presented a weakly polynomial time 
algorithm for the problem of edge expansion using dynamic programming. 

Here we show how to give a (1 + e)-approximation when the graph satisfies a certain local 
expansion condition. Note that this expansion condition is true for instance in 0(l)-dimensional 
grids when r = poly{l/e). 

Theorem 4.1. There is a polynomial-time algorithm that given a weighted graph G with orientable 
genus g in which 4>iocai > °i ^ <t>sparsest (where (piocai is the minimum sparsity of sets of size at 
most n/r) computes a [l+e)- approximation to sparsest cut and similar problems in 2'-^(''''poly(n). 
In fact the algorithm outputs one of the following. 

1. Either a subset with sparsity at most (1 + e)(;/)sDP; 

2. Or a subset of size at most ^ with sparsity at most ^ °i^ (psDP- 
Here (/>sdp is the optimum value of eq. (5) for r + 2 rounds. 



Before proving Theorem 4.1, let us first recall the theory of random partitions of metric spaces, 
and its specialization to graphs of bounded genus. If (V, d) is a metric space then a padded decom- 
position at scale A is a distribution over partitions P of y where each block of P has diameter A. 
Its padding parameter is the smallest /3 > 1 such that the ball of radius A//3 around a point has a 
good chance of lying entirely in the block containing the point: 

Fiohp[Bd{u, A//3) C P{u)] > I for all ueV. (3) 
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The padding parameter of a graph G is the smallest /3 such that every semimetric formed by 
weighting the edges of G has a padded decomposition with padding parameter at most f3. The 
following theorems are known. 

Theorem 4.2. 1. [LSIO] If G has orientahle genus g, then its padding parameter is 0{[ogg). 

2. [FT03] If G has no Kp^p minor, then its padding parameter is 0{p'^). 

Our main technical lemma is the following. 

Lemma 4.3. Given a graph G = {V, E) and positive integer r, there exists an algorithm which 
runs in time 2 '•''•' poly (n) and outputs one of the following for any e > 0: 

1. Either a subset with sparsity at most (1 + e)(j)sBP where (/>sdp is the optimum value of eq. (5), 

2. Or a subset of size at most - with sparsity at most —^4>sdp- 

Proof. The idea is to apply the algorithm from Theorem 2.1. If it finds a cut of sparsity (l + e)(/)sDP, 
then we are done. Otherwise let [X„]„ be the vectors output by it. We show how to use padded 
decompositions of the shortest-path semimetric given by distances \\Xu — Xy\\'^ and then produce 
a small nonexpanding set. 

Let V denote the average squared length of these vectors, i.e. ^ — \ X^m II^«IP so that v = 

/i(l — ^). Choose A at least | " ^ . Take a padded decomposition at scale A and pick a random 

partition P out of it. 

Claim: The expected number of nodes that lie in subsets of size less than n/r in P is at least 

1 V^ II V l|2 

2 Z-^u \\^u\\ ■ 

Proof For each subset S G P with size |5| > -, if we choose an arbitrary t £ S, eq. (2) implies 
that: 

u 5eP:|5|>^«e5 TeP:|T|<5uGT 

< E ^1^1+ E e™'4EW+ E 1^1- 

S€P:\S\>^ TeP:|T|<^MeT u T£P:\T\<^ 

M TeP:|T|<^ 

D 

Now we choose a threshold r G [0, A//3] uniformly at random. Then for each T £ P with 
\T\ < -, let T C r be the subset of nodes which are in the same partition block as the ball 
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of radius r around them. We output such T with minimum sparsity among all T G P with 

1^1 < 7- Using standard arguments, we can prove that any pair of nodes u and v is separated 

WX —X ip 
with probability at most - — ^lo" ■ This means the total expected capacity cut will be at most 

A X]u<t) C'„t,||X„ — X^lp. Moreover eq. (3) implies that: 
E 



y ifil>- y |ri>-yiix„i|2. 

TeP:\T\<n/r TeP:\T\<n/r u 



Putting all together, we see that there exists some T ^ P with \T\ < ^ such that 

(Pg[-L ) < ^^(pSDP- □ 

Combining Lemma 4.3 with the bounds from Theorem 4.2 immediately implies Theorem 4.1. 

5 Small-set Expander Flows 

In [ARV09], expander flows are used as approximate certificate for expansion, which work for all 
values of expansion. (By contrast, the eigenvalue or spectral bound of Alon-Cheeger is most useful 
only for expansion close to ^(1).) This section concerns small-set expander flows (SSE flows) which 
can be viewed as approximate certificates of the expansion of small sets. An (r, d, /3)-SSE flow is a 
multicommodity flows in which small sets S (ie sets of size at most n/r for some small r) have /SdlS"! 
outgoing flow where /3 is close to 0(1). The flow is undirected, and the amount of flow originates 
at every node is at most d. Since the flow resides in the host graph and /SdlS"! leaves every small 
small set S, an (r, d, /3)-SSE flow is trivially a certificate that small sets have edge expansion Q{dP) 
in the host graph. 

Of particular interest here will be a surprising connection between SSE flows and finding near- 
optimal SPARSEST CUT. In other words, information about expansion of small sets can be leveraged 
into knowledge about the expansion of all sets. We note that such a leveraging was already shown 
in [ABSIO] using spectral techniques, but only when Small set expansion is ^2(1), roughly speaking 
(the reason is that the proof is Cheeger-like) . 

We note that given a flow it seems difficult (as far as we know) to verify that it is an SSE flow. 
Thus we will also be interested in a closely related notion of spectral SSE flow, which by contrast 
is easily recognized using eigenvalue computation. This is the one used in our algorithm. 

Definition 5.1 (Spectral SSE Flow). A (r, d, X)-spectral SSE flow is a multicommodity flow whose 
vertices have degree between d/2 and d, and the r smallest eigenvalue of its Laplacian matrix is 
at least dX. 

The relationship between the two types of flow rely upon the so-called higher order Cheeger 
inequalities [LRTV12, LGT12]. 

Theorem 5.2 (Rough statement). // the graph has an {r,d,f3) SSE flow then it also has an 
{2r,d,Q{P'^/logr)) spectral SSE flow. Conversely, if the graph has an {r,d,X) spectral SSE flow 
then it has a weaker version of (r, d, (3 = X) combinatorial SSE flow. 



11 



See Lemma B.22 and Lemma B.24 for more precise statements. 

Now we describe how these results are useful. First, just existence of SSE flows is enough 
to imply a low integrality gap for the Lasserre relaxation. This is reminiscent of primal-dual 
frameworks (e.g., expander flows being a family of dual solutions for the ARV SDP relaxation and 
thus giving a lower bound on the optimum) but we don't know how to make that formal yet. 

Theorem 5.3. If a {r,d, \)- spectral SSE flow exists in the graph for d\ ^ -(^sparsest, then the 
GS rounding algorithm computes a (1 + e) -approximation to sparsest cut when applied on the 
0{r/e)-level Lasserre solution. In particular, the integrality gap of the Lasserre relaxation is at 
most (1 + e). 

The other result is a more direct approximation algorithm that does not use SDP hierarchies 
at all. Instead it uses a form of spectral rounding (as in [ABSIO]) that produces a set with low 
symmetric difference to the optimum sparsest cut, followed by the clever idea of Andersen and 
Lang [AL08] to purify this set into a bonafide cut of low expansion. 

Theorem 5.4. There is a 2^^>poly{n) time algorithm that given a graph and a {r,d, X)- spectral 

SSE flow for dX » -^<Psparsest OUtputS ffl CUt of Sparsity at most (1 + e) (j) sparsest ■ 

The above two theorems become important only because of the following two theorems which 
concern the existence of the flow. 

Theorem 5.5. If d <^ ^/oca/Vlog r/y/log n then the graph has a (r, d, ri((logr)^^)) SSE flow. 

This theorem follows from Lemma B.19 and Lemma B.20. 

Theorem 5.6. If d <^ *l'iocaZ\/logr/-^logn then the graph has a (2r, d, il((logr)^^)) spectral SSE 
flow. Furthermore, a (4r, d, r2((logr)~^)) spectral SSE flow can be found in polynomial time. 

This theorem follows Theorems 5.2 and 5.5 and Lemma B.25. The algorithm to find the 
spectral SSE flow uses the fact that maximizing the sum of first r eigenvalues of a matrix is a 
convex objective. 

In fact, when ^ local is small, we can actually find a small set that does not expand well. 

Theorem 5.7. For any graph G = (y,E) and any value d, there is a polynomial time algorithm 
that either finds a (4r, d, ri((logr)~^)) spectral SSE flow, or finds a set of size at most lOOn/r that 
has expansion at most 0{d\/\og n/\/log r) . 

For more details see Lemma B.26. 

5.1 Overview of proof of existence of SSE flows 

To keep the main paper relatively concise, we have move the proof of existence to the appendix 
and give an overview here. 

From a distance, the existence proof for SSE flows uses similar ideas as the one for expander 
flows in [ARV09]: we write an exponential size LP that is feasible iff the desired flow exists, and 
then reason about the properties of dual solutions (using properties of flows, cuts, and ^2 metrics) 
to show that the LP is feasible. 

We write an LP that enforces each vertex has degree at most d in the flow, and for every set S 
of size n/3r to n/r, the amount of outgoing flow is at least (id\S\, the precise LP can be found in 
Appendix B.l. 
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The dual of this LP consists of a nonnegative weight Sj for ah vertices and We for each edge, and 
also a nonegative weight for every set of size between n/3r to n/r. We shall prove the following 
Lemma: 

Lemma 5.8 (imprecise). Given a valid dual solution with degree d and /3 parameter = 0((logr)~^), 
there is an algorithm that finds a set of size at most lOOn/r with expansion 0{d^/log n/^/\og r) . 

In order for the algorithm to run in polynomial time, we first need to represent the LP dual 
concisely, and as stated above it involves a nonegative weight on exponentially many cuts! As in 
ARV, this concise representation is possible since a nonnegative weighting of cuts is an Hi metric 
and the algorithm is only interested in the "distance" between two vertices in this metric (which 
is the measure of sets that contains one of the vertices but not the other). The ^i metric can be 
concisely represented by some £| vectors; see Appendix B.2.1 and Lemma B.26. 

The proof of the Lemma above uses the "chaining" idea from [ARV09], but there are many 
differences which we list here. 

(a) The proof is handicapped since it is only allowed to use local expansion (i.e., expansion of sets of 
size at most 0{n/r)), and this requires us to invent novel ways of applying the region-growing 
framework in [LR99] (see Appendix B.2.3). Many steps in our algorithms rely on such region 
growing arguments, including Lemmas B.6, B.16 and B.18. 

(b) In [ARV09] all vectors have unit norm, here however the £| vectors can have different norms. 
We use a known reduction that transforms the vectors for a large subset of vertices, so that 
they are in a sphere of fixed radius. See Appendix B.2.4. 

(c) The existence of matching covers used in the ARV proof is unclear and has to be carefully 
established. This uses a certain "spreading constraint" that holds for £| metrics supported on 
small sets. See Lemma B.15. Also, a matching cover may not exist because a set of vertices 
is far away from other vertices in graph distance (distance according to the weights on edges). 
We call such sets obstacle sets of type I, and use region-growing arguments to remove these 
sets, see Appendix B.2.5. 

(d) The crux of the ARV proof is to prove the existence of a special pair of vertices that are 
close in graph metric (i.e., the metric given by the weights on the edges) and far apart in £2 
metric). From the existence proof and global expansion ^global, one can immediately establish 
the existence of r2(n) such pairs, which is needed in the argument. The analogous idea does 
not work here since the proof is handicapped by being restricted to only use local expansion. 
However, we show that this step can only fail if there exists an obstacle set of type II. We design 
another region-growing type argument to handle this; see Lemmas B.12, B.16 and B.18. 

(e) The ARV argument uses Alon-Cheeger inequality: for d regular graphs, the second eigenvalue 
of the Laplacian is ri(l) iff the graph has expansion ^2(1). The analogous result for small set 
expansion, the so-called "higher order Cheeger inequality," has only recently been established, 
and only in one direction and in a weaker form [LRTV12, LGT12]. This weak form makes us 
lose extra poly{logr) factors in many theorems which are potentially improveable. For details 
see Lemma B.22. 
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5.2 Finding Sparsest Cut using SSE flow 

Before we delve into the long proof of existence of SSE flows, we quickly show how they are useful 
in approximating SPARSEST CUT. As mentioned, there are two methods for this. 

5.2.1 Method 1: Using Lasserre Hierarchy Relaxation 

This will use a modification of an idea of Guruswami-Sinop which we now recall. Recall (see 
Appendix A) that the solutions for r' + 2 rounds of Lasserre Hierarchy relaxation satisfies the 
following property: 

Y.u<vCuv\\Xu-X,f Tx{X'^XL{G)) ^ 

Y- II V 112 = llYp = 'i^SDP, 



where the approximation ratio is bounded by (1 — n yna )~ over all sets S of size r' by Theo- 



rem 2.1. 



If 



Theorem 5.9 (Theorem 3.2 in [GS13]). Given positive integer r > and positive real e > 0, the 
above approximation ratio is upperbounded by [1 — t^ '^y, ',,2 ) for r' = - + r + 1 . 



1-^ II^IIf 

Proof. (Sketch) Using the column based low-rank matrix reconstruction error bound from [GS12b], 
it can be shown that there exists set S of size r' = r/e + r — 1 such that the numerator ||X^X|||, < 
(1 - e)-^ Ej>r+i CTjiX'^X), where aj{X'^X) is the f" largest eigenvalue of X^X. D 

In order to bound the sum of eigenvalues, the analysis in [GS13] uses von Neumann's trace 
inequality, which we present in a slightly more general form: 

Proposition 5.10. For any matrix Y ^ and positive integer r: 

> aAY ) = mm — — — . 

^^ ^ ^ zyo Xr+i(Z) 

In the original analysis of [GS13], this claim is used with Y ^ XX and Z ^ L{G), whereupon 
one obtains: 

E,>, cTiiX^X) ^ TV(X^X • L{G)) ^ 4>SDP 



< — ^ -^^ < 



\\X\\l - A,+i(G)||X||2, ^A,+i(G)- 

Thus 

Y.i<r^iiX X) ^ ^_ (t)SDP /^^ 

Consequently, the rounding analysis in [GS13] requires a bound on the A^+i value of the graph. 

Our idea is to use Proposition 5.10 by substituting the Laplacian of the spectral SSE flow as Z 
in the above calculation, and then use the lowerbound on the A,, value of this flow Laplacian. This 
uses the following easy lemma. 

Lemma 5.11. If X is described above, then for for any flow F that lies in the host graph G: 

T>i>r^ii^^^) ^ 4>SDP 

\\X\\l - Xr+liF)- 
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Proof. Since F is routable in G and X € ^2' ^^ have: 

Tr(X^X • L(F)) < TriX^X ■ L{G)) < (I)sdp\\X\\1. 

Choosing Y ^ X^X and Z ^ L(F), we see that the Claim imphes: 

Ei>r <X^X) ^ Tt[X^X . L{F)) ^ 4>SDP 
\\X\\l - A,+i(F)||X||2, - A,+i(F)- 

Now Theorem 5.3 fohows using Lemma 5.11 and eq. (4). 

Remark: Note that we only need Ar-+i(-F) to be more than (psDP- Such flows could potentially 
exist under more general conditions than our local expansion condition. 

5.2.2 Method 2: Using Subspace Enumeration and Cut Improvement 

We show that given a (r, d, A) spectral SSE flow, where dX is much larger than the expansion $ of 
sparsest cut, it is possible to use eigenspace enumeration idea of [ABSIO] together with the ideas 
of [AL08] to get a good approximation to sparsest cut. 

Lemma 5.12 (Eigenspace Enumeration, [ABSIO]). There is a 2^^'n^^' time algorithm that, 
given a graph whose r^^ smallest eigenvalue (of Laplacian) is Xr > 20$/e, outputs a set of subsets 
X C {0, 1} with the following guarantee: for every subset S that has expansion $, there is a vector 
X £ X such that 

\x-ls\ 8$ 

|ls| ~ -^■'• 

The above eigenspace enumeration allows us to compute a "guess" that has low symmetric 
difference with the optimum cut. Then we can use a simple version of cut improvement algorithm 
of [AL08] to improve it to a cut of low expansion. 

Lemma 5.13. There is a 2'-^^'^'n^^^' time algorithm that given a graph G = {y,E), and a {r,d,X) 
spectral SSE flow embeddable in G, enumerates 2 '''^n '^^ sets with the following guarantee. For 
any set S of size at most n/2 that has expansion ^{S) <^ dXe6 (for e + 6 < 1), there is a set T in 
the output such that —tqA < 5 and ^(T) < (1 + e)^{S) (A denotes symmetric difference). 

Proof. The capacity of flow that crosses S in the spectral SSE flow can only be smaller than 
^(S) ■ \S\ because the flow is embeddable in G. Hence when we apply Lemma 5.12 on the flow, we 
know there is a vector 1^ in X that is €5/2 close to the indicator vector of S. 

Using this vector, suppose we know the expansion $(«S') (later we shall see we only need to 
know this value up to multiplicative factor, so the algorithm will enumerate all possible values). 
Construct a single commodity flow instance where we add a source s and sink t to the graph. For 
each vertex i £ T, there is an edge from i to sink t with capacity 4<l?(S')/(5. For each vertex i ^ T, 
there is an edge from source s to i with capacity A^{S)/6. 

Now we flnd the min-cut that separates source s and sink t. Since T is close to S, we know 
the capacity of this cut is at most (1 + e/2)$(5)|5| because {s} U S achieves this capacity. Let the 
vertices that are on the same side with sink be Q, then we know IQATj < - — l^/gys — l^l^f^- 
Therefore ^ < M^^^^M < s. 
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On the other hand, the expansion of Q is at most 

(1 + e/2)^[S)\S\ - IQATI • 4$(5)/<5 (1 + e/2)^{S) - Ax^{S)/5 



< (l + e)$(5). 



|5| - |5AT| - IQATI {l-e5/2)-x 

(where in the second step, we substituted x = i^, ' ). D 

Corollary 5.14. Given graph G = {V^ E) and a (r, d. A) spectral SSE flow embeddable in G. There 
is a 2^^'n^^' time algorithm that: 

• Finds a set S with (p{S) < (1 + 0{e))(j)sparsest if dX > (psparsest/e^; 

• Finds a set S with $(5) < (1 + 0{e))<^giobai if d\ » <^giobai/e; 

• Finds a set S of size at least cn/2 such that ^{S) < (l+0(e))*^c-6a/anced '''fdX ^» ^c-halancedl^^' 

Proof, (sketch) For sparsest cut, choose 5 = e in Lemma 5.13. For edge expansion, choose 5 = 1/2. 
For c-balanced separator, choose 5 = c/2. D 

6 Conclusions 

The fact that it is possible to compute (1 + e)-approximation for sparsest cut on an interesting 
family of graphs seems very surprising to us. Further study of Guruswami-Sinop rounding also 
seems promising: our analysis is still not using the full power of their theorem. 

Our work naturally leads us to the following imprecise conjecture, which if true would yield 
immediate progress. 

Conjecture: (Imprecise) In ^'interesting" families of graphs — ie those where existing algorithms 
for SPARSEST CUT fail— ^ local /^global is large, say > y/logn. 

As support for this conjecture we observe that if our algorithm does not beat yTogn-approximation 
on some graph, then there is a constant r and a set of size n/r whose expansion is at least ^logn 
times the optimum. 

Furthermore, it is conceivable that SSE flows exist in graphs even when the local expansion 
condition is not met. For our analysis of the rounding algorithm from [GS13] we only need the 
existence of an SSE flow of degree say > l.lcpsparsest (see Section 5.2.1). Conceivably such flows 
exist in a wider family of graphs, and this could be another avenue for progress. 
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A Overview of Lasserre Hierarchy Relaxation for Sparsest Cut 
and Rounding 

In this section, we will give a brief description of Lasserre Hierarchy relaxation for Uniform Sparsest 
Cut problem, the rounding algorithm of [GS13] and its analysis. 

We present the formal definitions of Lasserre Hierarchy relaxations [Las02], tailored to the 
setting of the problems we are interested in, where the goal is to assign to each node in y a label 
from {0, 1}. 

Definition A.l (Lasserre vector set). Given a set of variables V and a positive integer r, a collection 
of vectors x is said to satisfy r-rounds of Lasserre Hierarchy, denoted by x G Lasserrer(F), if it 
satisfies the following conditions: 

1. For each set S G (<^_|_]^), there exists a function xs '■ {0, 1} — )• M that associates a vector of 
some finite dimension T with each possible labeling of S. We use xs{f) to denote the vector 
associated with the labeling / G {0, 1}'^. For singletons u G F, we will use Xu and Xn(l) 
interchangeably. For / G {0, 1}'^ and v ^ S, we use f{v) as the label v receives from /. Also 
given sets S with labeling / G {0, 1} and T with labeling g G {0, 1} such that / and g 
agree on SOT, we use f o g to denote the labeling of 5 U T consistent with / and g: If n G S, 
if ° g){'u) = f{u) and vice versa. 

2. ||x0f = 1. 

3. (xsif), xxig)) = if there exists u £ S CiT such that /(n) ^ g{u). 

4. (xs(/), xt(<7)> = {xA{f').XB{g')) if SUT = AUB and fog = fog'. 

5. For any ueV, Eie{o,i} ll^«(i)P = 1^0 IP- 

6. (implied by above constraints) For any S G {^r+i) ^"^ ^ ^ ^■iid / G {0, Ij-^vi"/ , X^gejo i|" ^s{f° 
a) =xs\{u}{f)- 

One can view ||xs'(/)|p as the "probability" of /, in which case the corresponding "conditional" 
probabilities are given by \\ it\\Yi ■ Our relaxation is the following: 



1 V^ /^ II ||2 

™™/^'2; :,^;^jj:ijj^ l^u<v^u,v\\Xu — Xv\\ 

St ^Ena:„ = /"2;0, (5) 

||x0|p = 1, X G Lasserrer'4.2(^), /i G {1/n, 2/n, . . . , 1/2}. 

Note that we can easily eliminate the variable /x from eq. (5) by enumerating over all ^ values. For 
the special case of uniform sparsest cut, the rounding algorithm from [GS13] can be summarized 
as follows. Given a feasible solution of eq. (5): 

1. Let Xu = Xu - }iY.v^v = Xu - ^ix$. 

Observe (i) ||X„-A„p = ||x„-x„P; (ii)E«^« = 0; (iii) Eu ll-^«P = ^ E«<« Ik^-^^^P = 
H{l-fi) ; (iv) l-/x> \\Xuf>fi^. 

2. Choose a set S of r' nodes using column selection [GS12b, DRIO] from [A„]„. 
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3. Sample / : 5 — )• {0, 1} with probability proportional to ||x5(/)|p. 

4. Perform threshold rounding using the "conditional probabilities" assigned for each node u € V 
which is proportional to {xs{f),Xu)- 

B Constructing SSE Flows 

B.l Definition and LP Formulation of SSE Flows 

Recall for any graph G = {V, E) , a multicommodity flow in G assigns demand 5ij for pairs of 
vertices i, j, and simultaneously route Jjj- units of flow from i to j for all pairs while satisfying 
capacity constraints. In particular, if we use fp to denote the amount of flow routed along path 
p, Vij to denote all paths from i to j, then a multicommodity flow should satisfy the following 
constraints: 

yeeE Yfp<ce (7) 

We shall only consider symmetric flows (i.e. 5ij = Sj^i). For a multicommodity flow, we call 
6i = Yljev ^i,i ^^^ degree of vertex i. Now we can define SSE flows: 

Definition B.l (SSE flow). A (r, d, /3)-SSE flow is a multicommodity flow whose vertices have 
degree at most d, and for any set S of size at most n/r, we have 

Y, k,>^d\S\. 
ies,j<^s 

For a flow, we will define the expansion of a set S to be — ^~^^MV~~^^ ^° ^^^ requirement of SSE 
flow is just the expansion of all small sets should be at least /3. 

SSE flow is a complicated object, we will also use two weaker versions of SSE flow. The first 
one is especially useful for the LP formulation: 

Definition B.2 (Weak SSE flow). A (r,d,(3) weak SSE flow is a multicommodity flow whose 
vertices have degree at most d, and for any set S of size between n/3r and n/r, we have 

Y ^i,J>l^d\S\. 

Notice that the idea of restricting set S to have roughly size n/r is also used in [ARV09] (where 
in the LP formulation the sets have size n/6 to n/2). We will use the LP formulation for weak SSE 
flows: 

"^i^y YY. fp^"^ (8) 

yeeE ^ /p < Ce (9) 

eep 

yS cV,n/3r<\S\<n/r ^ J^ /p > /3d|5| (10) 
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Later we show this LP is feasible for some values of /? and d by showing that the dual LP is not 
feasible. The dual LP is given by 

Y,CeWe + dY,Si<PdY,Zs\S\ (11) 

e ieV s 

yi,j,P e Vij ^We + Si + Sj > ^ zs (12) 

ZS,We,Si>0 (13) 

Here Sj (i € V), We (e G F) and zs {S C y^,n/3r < |5| < n/r) are the dual variables corresponding 
to eqs. (8), (9) and (10) respectively. Since the dual LP is homogeneous, all variables can be scaled 
simultaneously without effecting the validity of a solution. Throughout this section we shall always 
assume the following normalization: 

Y,zs\S\=n (14) 

s 

We would like SSE flows to serve as approximate certificate of small set expansion. However, small 
set expansion is in general hard to certify, even if the expansion is close to 1. Hence we give a 
weaker (but still useful) form, spectral SSE flow, which can be easily certified, and is closely related 
to combinatorial SSE flows by high order Cheeger's inequality [LRTV12, LGT12]. 

Definition B.3. A (r, d, A) spectral SSE flow is a multicommodity flow whose vertices have degree 
between d/2 and d, and the r smallest eigenvalues of the Laplacian of the graph is at least dX. 

B.2 Existence of SSE Flows 

The main result of this Section is the existence of weak SSE flows. 

Lemma B.4. For any graph G = {V,E) and any d > either there is a set of size at most 
lOOn/r that has expansion smaller d\/logn/\/logr, or there exists a {r,d,Q{log^^r)) weak SSE 
flow emheddahle in G. 

B.2.1 il Mapping of zs 

In order to show that weak SSE flow exist, we argue that the dual LP does not have any valid 
solution. In fact, we show something even stronger: given any dual solution that satisfles eq. (11), 
eq. (13) and eq. (14), there is a polynomial time algorithm that either finds a nonexpanding set of 
size at most lOOn/r, or finds a path where eq. (12) is violated. 

The dual solution has exponentially many variables. We shall use a compressed description 
that is good enough for the algorithm: all the variables zs are mapped to n vectors Zi, ..., Z„, such 
that for all i,j, ||Zj — Zj\\^ = Yli^s jiiS ^s- This is possible because Zj's can have one coordinate 
for each set S, if i is in S then the coordinate is ^/zs, otherwise the coordinate is just 0. 



The upper-bound on the size of S implies for any i, there are at most Cn/r points within 
^2 distance \\Zi\\2/C for all C > 1. This is because for any set S of Cn/r points, if we pick 
a random point j in S, with probability 1 the expected distance between i and j is at least 
Ejg5[||Zj — ZjW^ > J2t3i'^sPAJ ^ t] ^ Yl,t3i'^s/C = WZiW^/C. To avoid giving out exponentially 
many variables, the dual solution will only contain Zj's that satisfy this property. 
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We shall rewrite the constraints for Z,'s 



y^CeWe + dy^Si < pdn (15) 

e iev 

ZS,We,Si>0 (16) 



Eii^> 



*II2='^ (17) 



|2 / ^ II ^ l|2 



ViGF,C>l \{j ■.\\Zj-Zi\\^^<i\\Zi\\y5}\<5n/4r (18) 

Vi G y ||Zi||2<3r (19) 

Vi, j G V,p e Pjj ^ i(7e + Sj + Sj > \\Zi - ZjWl (20) 

The last constraint is called the spreading constraint. A candidate dual solution is just a set of 
variables Sj(i G V),'We{e G E),Zi{i G S") that satisfies these constraints. 

Consider WeS as edge distances on the graph, and let dij be the shortest path distance between 
i and j with weights We- From now on we refer to this weighted distance as the graph distance. 
Intuitively, given a candidate dual solution, the algorithm either finds a nonexpanding set or finds 
two vertices who have small Sj's, small distance in graph distance and large distance in £2 distance. 

B.2.2 Proof Idea 

Given the dual solution, we try to apply arguments similar to [ARV09] in order to find a pair of 
vertices that are close in graph distance, but far in /^ metric. When this pair of vertices also have 
small Sj's, it violates Equation (20) hence contradicting the feasibility of dual solution. 

In [ARV09] this proof goes by projecting all points along a random direction and arguing that 
there must be many pairs of points that are close in graph distance but far in projection distance: 
they call it a matching cover. However, constructing a matching cover in our setting case is highly 
nontrivial, because the proof is only allowed to use local expansion. We adapt the region-growing 
argument in [LR99] in novel ways to solve this problem, see Appendix B.2.3. 

The first difficulty in the argument is that each vertex might have very different ||.^i||2 (that is, 
they are in very different measure of sets in the dual solution) . But this is easily fixed by embedding 
the points into a single scale using ideas from [ALN08] (see Lemma B.6 in Appendix B.2.4). 

It turns out that in order for a matching cover to not exist, one of the following two types of 
obstacles must exist, detailed discussion appears in Appendix B.2.5. 

The first type of obstacle set is a set whose Dg-neighborhood in graph distance {Dq is a pa- 
rameter that will be chosen later) contains only 0{n/r) points. Intuitively, this is an obstacle since 
it would be hard to match these vertices to other vertices within graph distance Dq because they 
simply don't have enough neighbours. We show that the total volume of such sets cannot be too 
large using the region-growing framework, see Lemma B.ll. 

The second type of obstacle set is a set of at most lOn/r vertices with large Sj whose Dq 
neighbourhood in graph distance contains only 0{n/r) points with small Sj. Intuitively such sets 
are bad because we want to construct matching covers only on vertices with small Si (in order to 
get the final contradiction with Equation (20)). Such sets would mean it is possible for a set S 
with small Si to be only close to vertices with large Sj's, and it would be impossible to match all 
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the vertices in S with vertices with smah Sj's. We again use region-growing arguments to remove 
such sets. The number of vertices removed cannot be large, because otherwise the sum of Sj's will 
be too large and violates eq. (11) (see Lemma B.12). 

Without these obstacle sets, it becomes possible to construct a matching cover (see Lemma B.16 
in Appendix B.2.6). This matching cover allows us to adapt arguments in [ARV09] (see Lem- 
inas B.17 and B.18), and conclude that either there is a short path (in graph distance) that crosses 
many cuts, or there is a nonexpanding set. The first case contradicts with the validity of the 
dual solution. In the second case we get a nonexpanding set, which again implies the existence of 
obstacle sets of type I or XL 

B.2.3 Region-growling Argument 

As mentioned earlier, a key component of our proof is the region-growing argument from [LR99]. 
This argument applies to an undirected graph whose edges have arbitrary nonengative capacities. 
The goal (in [LR99]) is to give a partition into blocks that have low diameter (distance being 
measured using edge weights) and on average have few edges crossing between the blocks. The 
tradeoff between these two quantities is controlled by the expansion of the underlying unweighted 
graph. Here we view this argument as giving an efficient partition oracle, which maintains a set of 
vertices Vi at step i (Vq = V). At step i the oracle takes a set Si C V^-i of size at least n/F{r) 
where F is a fixed polynomial, and then outputs S[, Si C S'- C Vj_i, and updates Vi = Vi_i\5j'. 
There is a "center" j G Si such that every other j' £ Si has distance at most Dq to j (we specify 
Do later in Lemma B.5). 

At step t, we say the partition maintained by the oracle is the collection of disjoint sets 
S[, S2, ..., Si, Vi. The capacity of edges in the partition is always at most na/20Alog30r. 

Lemma B.5 ([LR99]). Let G = {V,E) he a graph with edge capacities Cg and edge lengths We- Let 
W he the total weighted edge length: W = X^ee-B'-e'^^e- Then for any polynomial F{r), and any 
Dq = CA log 30r • log rW/na (where C is a constant depending on F), there is an efficient partition 
oracle whose partitions always have capacity at most na/20Alog30r. 

Proof. The proof is similar to Lemma 3 in [LR99]. However there the region-growing procedure 
starts from a single vertex (and the loss is log n because n is roughly the ratio between the volume 
of the graph and the volume of a single vertex). Here instead we start region-growing from the sets 
given to the oracle. Because the sets all have large volume (more than n/poly{r)), we lose only a 
logr factor. D 



B.2.4 Reducing to Single Scale 



The region growing argument applies to a particular scale A. However, not all vertices have ^■^ 



norm close to that scale. In this part we show how to reduce the problem to a single scale A. 

Lemma B.6. Given a dual solution with j3 < Cp{logr)~^''^ , there is an algorithm that finds a 
A, and calls a partition oracle with scale A and size F{r). After the algorithm, the numher of 
remaining vertices in the oracle is at least ^^ ^ gg^ , and all hut n/F{r) of the remaining vertices 
satisfy one of the two properties: 

1. \\Zi\\l > A/2. 
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2. Si > 1)2/10 = 0( A/ log r). 

The value D2 comes from Lem,m,a B.18 and will he i7(A/logr). 

First we use an averaging argument to find A. 

Lemma B.7. There exists some threshold 0.1 < A < 3r such that the number of vertices with 
\\Zi\\r, > A is at least 



j|l2 ^ ^ „o U.0 ,^u,ou 4^iog30r- 

Proof. We shall bucket the vertices according to ||.^i||2- There will be 6 = [logSOr] buckets, the 
u-th iu G {1,...,6}) bucket B^ contains vertices with ||.^i||2 in range [0.1 * 2"^-*^, 0.1 * 2"). There 
will be one extra bucket Bq which contains vertices with ||^i||2 in range [0,0.1). By Equation (19), 
IIZJL < 3r so these buckets cover all vertices. 

We know ^ \\Zi\\2 = n, let lu = J2ieB Il-^ill2' then X^„=o^« ~ "-• ^^ ^^^o know Iq < O.ln, 
so there must be a bucket B^ with lu > 0.9n/b. Choose A = 0.1 * 2"~^, we know the number of 
vertices with ||.^i||2 > A is at least the size of B^, which is at least luf^^ > n/4Ab. D 

Now we are ready to prove Lemma B.6. 

Proof of Lemma B.6. Take the value A from Lemma B.7. Let Q be the set of vertices whose ||.^i||2 
is at most A/2 and Sj is at most D2/IO. Let B be the set of vertices whose ||.^i||2 is at least A. 

If the size of Q is at most n/F{r), then the Lemma is true. Otherwise, use the partition oracle 
to separate the set Q. From the oracle we get a Q' which contains everything in Q. Consider any 
vertex i £ Q' B, hy definition of oracle we know there is a vertex j £ Q such that Dij < Dq. 
On the other hand, ||Zj — Zj\\:^ > \\Zi\\2 — WZjW^ > A/2. By eq. (20) we know Si > r2(A), since 
X^igy '^* — Z^""' ^^^ ^^^^ of Q' n B is at most (3n/A < |i?|/50. The size of current set of the oracle 
is at least n/5Alog30r. D 

We shall also use the following Lemma from [ALN08, MN04] to project everything to a ball of 
squared radius A. 



Lemma B.8 ([MN04]). There exists a mapping T : £2 ^ I2 such that ||r(z)||2 < v A for all z £ £2 

and for all z,z' G £2 

1^ \\T{z)-T{z')\\, ^^ 

2 minjvA, \\z — z'\\2} 
As a corollary, we now prove the following. 

Corollary B.9. There is a mapping that maps Zi to Xi, such that \\Xi\\2 = yA/2 for all i E V, 
and for all i,j G V 

1 A j A o r, 

8 min{A, \\Zi — Zj^^) 



Proof. Just let Xi = 4=T(Zi) © (y A — ||T(Zj)||2) where © denotes concatenation of vectors. It is 
easy to verify the claim. D 

After mapping all the Zj's to Xj's, for a vertex i with ||.^j||2 > A/2, vertices that are within 
squared distance A/20 in X metric are also within squared distance 2A/5 in Z metric (Corol- 
lary B.9). By spreading constraints there can only be at most 5n/4r such vertices. 
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B.2.5 Obstacle Sets 

The plan of the proof is to apply cover composition from [ARV09] in order to find a short path 
(in graph metric) that crosses a lot of cuts. At any step i, let Vi be the remaining vertices in the 
partition oracle. Let Qi be the set of vertices in Vi that have large s values (at least D2/W as in 
Lemma B.6. In this case there are two kinds of obstacle sets that prevents us from applying the 
cover composition argument. 

Definition B.IO (Obstacle Sets). At some step i of the partition oracle, a set S" C V^ is an 
obstacle set of type I if it has size at least n/F{r), and the Dq neighbourhood contains at most 
lOOn/r vertices in Vi. 

A set T C Qi is an obstacle set of type II, if it has size at least lOn/r, and the Dq neighbourhood 
contains at most lOOn/r vertices in Vi\Qi. 

Using region-growing arguments, we can remove the obstacle sets using the partition oracle 
without removing many vertices. 

Lemma B.ll. For a partition oracle with distance Dq as in Lemma B.6, if at some step i, 
'^jKi |5'.|<ioon/T- \^'j\ — '^a/lOAlogSOr, then one of the Sj of size at most lOOn/r has expansion at 
most a. 

In particular, there is a set H C Vi whose size is at least \Vi\ — n/10Alog30r, such that any 
subset S d H of size at least n/F{r) expands to at least lOOn/r vertices in H. 

Proof. If we take the sum of capacity of all outgoing edges from these 5', each edge in the partition 
is counted at most twice, therefore 

Y^ \E{S'j,V\S'j)\<na/lOAlog30r. 

j<i,\Sj\<100n/r 

On the other hand we know the sum of sizes is at least n/10Alog30r, by averaging argument there 
must be one set that has expansion a. D 

For proving Lemma B.19 this a will be chosen as 0{d^/logn/^/logr). 

Notice that it is very important that the algorithm always uses the partition oracle when it 
wants to remove a set of vertices. If the algorithm simply removes a set of vertices, it will be hard 
to bound the number of edges cut, and Lemma B.ll is no longer true. In this case we may have 
many obstacle sets of type I and cannot find a matching cover. 

For obstacle sets of type II, since a large fraction of the vertices in their neighbourhood have 
large Si, they cannot cover a lot of vertices without contradicting the validity of the dual solution 

Lemma B.12. Use the partition oracle in Lemma B.6 to remove obstacle sets of type IL At any 
step, let II be the set of steps where a set of type II is removed. Then J2j<^n I'^'jl — "'/20Alog30r. 

Proof. By the definition of obstacle sets of type II, we know each S', contains at least lOn/r vertices 
with s-value at least D2/IO. On the other hand, it contains at most 90re/r vertices with s-value 
smaller than D2/IO. Therefore 1/10 fraction of the vertices in 5' have s value at least D2/IO. 



- 10 10' 

.-2, 



On the other hand X^neV ^" — '^"'' ^° when /3 = C log r for small enough C we know Yljeii \^'j\ — 
n/20Alog30r. D 
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B.2.6 Gaussian Projections and Matching Covers 

Recall the definitions of Matching Covers and Uniform Matching Covers in [ARV09]: 

Definition B.13. A (a, 6, c')-niatching cover of a set of points is a set A4 of niatchings such that 
for at least a fraction 6 of directions u, there exists a matching M„ G 7W of at least c'n pairs of 
points, such that each pair {i,j) G M^ are within graph distance 2Dq, and satisfies 

{Xi - Xj,u) > 2aVA/Vd. 

The associated matching graph M is defined as the multigraph consisting of the unions of all 
matchings M^. 

Definition B.14. A set of matchings M (o", (^)-uniform-matching-covers a set of points S if for 
every unit vector n, there is a matching Mu of S such that every {i,j) S Mu is 2Do close in graph 
distance, satisfies | {u,Xi — Xj) \ > 2a^/A/Vd, and for every i, fi{u : i matched in M„) > 6. 

Notice that in addition to the properties in [ARV09], we further require that every matched 
pair must be close in graph distance. 

Let the dimension of Xj's be d. Let u be a uniformly random unit vector, when d is large 
enough we know 

Lemma B.15. There exists thresholds < di < 62 such that 62 — Oi = Q.{^J\ogr / yd) and poly- 
nomial G{r). Let Good{u) be the event that number of vertices with projection more than 9iA is 
smaller thanbn/r. For any vertexi whose \\Zi\\2 > A/2, Pr[u-Xj > 62VA and Good{u)] > 1/G{r). 

Proof, (sketch) We know each such i has at most 5n/4r closeby points. For points that are not close, 
conditioned on i has large projection, the probability that they also have pretty large projection 
is very small. Hence conditioned on i being in, the expected number of vertices that have large 
projection is small. By Markov we know Pr[Good{u)\u ■ Xi > 02 vA] > 1/2. D 

Given a set of vertices Vt, which can be partitioned into three parts P, Q, R, vertices z G P all 
have ||^j||2 ^ A/2, vertices j £ Q all have Sj < D2/W, and |-R| < n/F{r) (notice that this is exactly 
what's guaranteed by Lemma B.6), we will use the following algorithm to find matching covers: 



Construct Cover (P,Q,R) 

1. Pick uniformly random unit vector u. 

2. Let Left = {i : i £ P and {Xi,u) > O2VA}, Right = {i : i £ P and {Xi,u) < 6'i\/A}. 

3. While exists pair i £ Left and j G Right within graph distance 2Dq. 

4. Match {i,j), remove i,j from Left, Right. 

5. Fail if \P\Right\ < 5n/r and number of unmatched vertices in Left is at least n/F{r) 

If the algorithm fails, the following Lemma shows that we will have an obstacle set of type I or 
IL 

Lemma B.16. // Construct Cover fails, then it finds an obstacle set of I or IL 
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Proof. Let S be the set of vertices that are left unmatched in Left. Let T^g (S) be the Dq neigh- 
bourhood oi S in P U Q U R. If |r£)Q(5)| < lOOn/r first case of the Lemma is satisfied. 

If |r£)Q(S')| > lOOn/r, then either |r£)(j(5) n P| > 80n/r, in which case by simple counting 
argument there must be a point left in Right that is close to some point in S, and these two 
vertices can be matched (this contradicts with the assumption). When |r£)(,(5) n P| < 90n/r, let 
T = Tdo{S) n Q. Clearly |r| > lOn/r, and TooiT) D P C T2Do{S) n P. The number of 2Do 
neighbours of 5 in P cannot be more than 90n/r (otherwise we will be able to find a matching 
pair), hence \Tdo{T) n P\ < 90n/r and \rDo{T) n {P U R)\ < 90n/r + n/F{r) < lOOn/r. D 

[ARV09] has a Lemma that shows matching covers imply uniform matching covers. However 
in our situation, in order to apply the cover composition Lemma, we need a really large uniform 
matching cover, which is not guaranteed by the Lemma in [ARV09]. 

If Construct Cover does not fail with polynomial probability, then Lemma B.15 means the 
matching cover is already "almost" uniform, in the sense that if we ignore the fact that n/F{r) 
points will not be matched, each vertex will be in the matching with probability at least 1/G(r). 

Lemma B.17. // Construct Cover fails with probability less than l/n'^G{r), then there is a set 
W <Z P of size at least \P\ — A\P\/r that is {1/G{r)r, 62 — Oi) uniformly matching covered. 

Proof. Consider the matching graph. First, even for the n/F{r) points that remains unmatched, 
consider that they are matched to something. In this case each vertex has degree at least 1/G(r) 
by Lemma B.15. 

Now remove the edges that correspond to unmatched edges. In this step we have removed at 
most |P| • 1/G{r)r volume. Then we repeatedly remove any vertex that has degree at most 1/G{r)r. 
Again we will remove at most \P\/G{r)r volume. So the total volume removed is bounded by 
2\P\/G{r)r. 

However, we know that each vertex in P started with degree at least \/G{r). Removing 
2\P\/G{r)r volume can reduce the degree of at most 4|i-*|/r vertices to below l/2G(r). There- 
fore at most 4|P|/r vertices are removed. D 

B.2.7 Adapting ARV 

Using the uniform matching cover constructed above, and mechanisms in [ARV09], we can get the 
following Lemma. 

Lemma B.18. If W C V and has {l/G{r)r,Q{^/]ogr/\/d) uniform matching cover. Then there 
exists an algorithm that either finds i,j G W , such that dij < Di = 0{Dq ■ y/logn/y/logr), and 
\\Xi — XjW'^ > D2 = r2(A/logr), or finds a set whose 2Dq neighbourhood has size smaller than 
lOOn/r in P. 

Proof. The proof follows from [ARV09] , the algorithm basically follows the cover composition proof, 
maintaining the cover S^ along the induction steps (this is possible because the probabilities we 
are dealing with are all larger than some inverse polynomial, and the probabilities do not need to 
be estimated exactly). The main differences are: 

1. Here we need to boost the probability from 1/G{r)r to 1 — 1/r, this is D2 = ^{A/logr) (in 
[ARV09] we can find a pair that are constant distance away in £2 metric). 
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2. The definition of non-expanding set is now a set that does not expand to lOOn/r vertices within 
graph distance 2Do. This is OK because either there is a pair within graph distance 2Dq and 
^2 distance more than D2, in which case the Lemma is true; or ah vertices in this neighbouring 
set are also close in £2 distance, which then matches the definition of non-expanding set in 
[ARV09]. D 

B.2.8 Final Proof 

The following Lemma immediately implies Lemma B.4. 

Lemma B.19. Given a dual solution with degree d and expansion fi < (7^ log" r (where Cp is a 
universal constant), there is an algorithm that finds a set of size at most lOOn/r with expansion 
0{dy/\ogn/ y/\ogr) . 

Proof. First apply Lemma B.6. If Lemma B.6 did not find a set, then we have sets P,Q,R from 
Lemma B.6 and a partition oracle whose current set is PL) QU R. 

Now we shall repeatedly apply Construct Cover. In this case we can get an obstacle set of type 
I or II. 

If it fails with more than l/n^G(r) probability then we get an obstacle set from Lemma B.16. 
Otherwise we would have a large uniform matching cover by Lemma B.17. Then we apply 
Lemma B.18 on this uniform matching cover, under the assumptions Di < D2/IO ^, since W C P 
the first case of Lemma B.18 cannot happen. We must get a non-expanding set. Lemma B.16 also 
applies to this non-expanding set and we can again get an obstacle set of type I or II. 

Once we get the obstacle set, feed that set into the partition oracle, and recurse on the current 
set of the oracle. We always call obstacle sets of type I S, and obstacle sets of type II T. The 
corresponding sets returned by the oracle will be called S' and T', respectively. 

At the end one of the two cases will happen: Either the sets corresponding to S' take up 
more than n/10Alog30r vertices or the sets corresponding to T' take up more than n/10Alog30r 
vertices. 

In the first case Lemma B.ll shows one of the S' must have low expansion. 

The second case contradicts the feasibility of dual solution because of Lemma B.12. D 

B.3 Getting SSE Flows and Spectral SSE Flows 

Thus far our existence proof dealt with weak SSE flows. 

B.3.1 Getting to SSE flows 

Lemma B.20. If G = {V,E) is a graph and a {r,d,f3) weak SSE flow is emheddahle in G, either 
there is a set of size at most n/r that has expansion less than d(3, or there exists a (r, d, /3/6) SSE 
flow emheddahle in G. 

Proof. Let F be the weak SSE flow. If for all sets S of size IS"! < n/3r, the F has expansion at 
least /3, then F is already a SSE flow. 

When there exists S of size smaller than n/3r and the expansion in F is smaller than /3, remove 
S (for remaining vertices replace edges going to S with self-loops) and repeat this procedure. 



^Notice that the constant in Di is in fact hiding in the expansion 0{d^log n/^\og r) which can be chosen inde- 
pendently of D2 ■ 
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If the union of the removed sets is U, the size of U cannot be larger than n/3r: if after removing 
some S the size of U first become larger than n/3r, then since S has size smaller than n/3r, the 
size of U must be between n/3r and 2n/3r. The expansion of U is at most the maximum expansion 
among sets S, which is smaller than /3. Such a set cannot exist by the definition of weak SSE flows. 

Now add a source and a sink to the graph. Add an edge from source to every vertex in U with 
capacity (i/3, add an edge from every vertex in V\U to the sink with capacity df3, and then try to 
route the maximum single-commodity flow from source to sink. 

If the maximum flow is smaller than df3\U\, then there must be a cut of value smaller than 
dl3\U\ in the new graph. Let Q be one side of this cut that contains the source, then E{Q, V\Q) < 
dl3\U\ — \Q (B U\d/3 (this is because, for every i in Q but not U, it has degree d(3 to the sink; for 
every i in U but not Q, it has degree df3 from the source), and |(5|>|t/| — |(5©f^l- The expansion 
of Q is strictly smaller than df3. 

If the maximum flow is df3\U\, let the single-commodity flow be Fi, and let F2 = {F + Fi)/2 
(here "+" just take the linear combination of demands). Clearly F2 is still embeddable into G. For 
any set S of size at most n/r, if more than \S\/3 of the vertices are outside U, then it already has 
l3d\S\/Q outgoing edges outside U in F/2; if less than \S\/3 of the vertices are outside U, then it 
has d(3\S\/Q outgoing edges just by the flow Fi/2. Therefore F2 is a (r, d, /3/6) SSE flow. D 

Unfortunately, this Lemma is only existential. In general, even if we are given a SSE flow, it is 
hard to verify it exactly. 

B.3.2 Getting Spectral SSE flow 

We can use higher order equivalents of Cheeger's Inequality to establish a relation between SSE 
flows and spectral SSE flows: 

Theorem B.21 ([LRTV12, LGT12]). For any graph G, $^ < 0{^/\^r\CJ\ogr). Here A2r(>C) is 
the 2r-th smallest eigenvalue of the normalized Laplacian of G. 

This implies that if the largest and smallest degree are close, then an SSE flow is already a 
spectral SSE flow. 

Lemma B.22. For any graph G = {V,E), if there is a {r,d,(3) SSE-flow embeddable in G, then 
there is a (2r, d, J7(/3^/logr)) spectral SSE flow em,beddable in G. 

Before proving Lemma B.22, we will need the following simple claim so as to relate the eigen- 
values of normalized Laplacian matrix to the original Laplacian. 

Claim B.23. Let dminjt^max be the minimum and maximum degrees in G, respectively. Then: 

^ -L{G) ^ C{G) ^ -^L{G). 



O^max '^n 

Proof For any pair of nodes u,v, dmin < Vd^AJ < dmax- Hence for any x G M^: 

"max ^^^ "max ^^^ yf dud^u "min 
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Proof of Lemma B.22. Let F be the {r,d,f3) SSE-flow, let Fi be a flow whose demands are Sij = 
Cij. Clearly Fi is embeddable in G and has degree 1. Let F2 = F/2 + dFi/2, then the degrees of 
vertices in F2 are between d/2 and d. 

By definition of SSE flow we know $^(-^2) > P/2. Let C be the normalized Laplacian of F2, 
and L be its Laplacian, then by Theorem B.21 A2r(>C) > Q{P'^/ logr). 

Since the degrees of F2 are all between d/2 and d, by Claim B.23, the eigenvalues of its normal- 
ized Laplacian are closely related to its Laplacian: \2r{L) > 2X2r{^) = Jl(c?/3^/logr). D 

The inverse direction (spectral flows imply combinatorial flows) is also true, except the combi- 
natorial expansion must be defined on r disjoint sets instead of one set. 

Lemma B.24 ([KLL~^13]). A (r,d,X) spectral flow satisfies the following combinatorial expansion 
property: for any r disjoint sets 81,82, ---jSr, the maximum of the expansion of these sets is at 
least A/2. 

Proof. This proof comes from [KLL"'"13], we restate it here for completeness. We use Courant- 
Fischer-Weyl characterization the variational definition of r^^ smallest eigenvalue: 

. h^Lh 

Xr[L) = mm max „ . 

subspace P of dimension r h^P h h 

Let the subspace P be the span of the indicator vectors of 8iS. For any h = Yl\=i ^i^Si^ for all 
u,v &V , 

r 

(kin) - h{v)f < ^2Af(rs.(^) - IsAv? 



So the Rayleigh Quotient of h is at most 



R{h) = maxh^Lh/ \\h\\l < ^n=iA»(l5.(^)-l5.W ^ 2maxR{ls,). 



Sr 



2 

We know maxi?(/i) > A, so the maximum expansion must be at least A/2. D 



In order to find spectral SSE flows, the following algorithm uses a convex program: 

Lemma B.25. // there exists a (r, d. A) spectral SSE flow embeddable in G, there is an efficient 
algorithm that finds a (2r, d, A/2) spectral SSE flow. 

Proof. The algorithm tries to solve the following optimization problem: 

2r 

max^Ai(L(F)) 

s.t.yi GV 2<EE^p^^- 

F embeddable in G. 

Here L(F) is the Laplacian of the flow. The first constraint just says the degree of every vertex 
should be between d/2 and d. This is a convex program because entries of L(F) are linear functions 
over fp, and the sum of first 2r eigenvalues of a matrix is a concave function. The convex program 
can be solved in polynomial time.^ 



There are exponentially many paths, but there is a canonical way of reducing the number of variables for flows. 
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Clearly the (r, d, A) spectral SSE flow is a feasible solution and has objective value at least rd\. 
Hence the solution of this convex program must have objective function at least rdX, which means 
the 2r-th eigenvalue of L{F) is at least ^ = dX/2. D 

B.4 Finding a Small Nonexpanding Set when Eigenspace Enumeration Fails 

Combining Theorems 5.4 and 5.6, we know if ^locai for a graph is at least 0{^ global V^'^S ^ log^'^ i"/^)^ 
there is an eigenspace enumeration algorithm that finds a (1 + e) approximation of sparsest cut. 
Here we show when the algorithm fails, how to find a small set that does not expand in polynomial 
time. 

Lemma B.26. Given a graph G, for any d, r, there is a polynomial time algorithm that either finds 
a {2r,d,X = il((logr)~^)) spectral flow, or finds a set of size at most lOOn/r that has expansion at 
most 0{d^/log n/^/log r) . 

Proof. By Lemmas B.20 and B.22, we know a weak SSE fiow implies a spectral SSE flow unless 
there is a small set with very small expansion. Therefore if the algorithm in Lemma B.25 does not 
work, either there is a set of size at most n/3r that has expansion dj3 where /? = 0((logr)^^), or 
there is no weak SSE flow. 

In the first case we can simply run the approximation algorithm for small set expansion in 
[BFK+11], which gives a -y/log n log r approximation, the set we get will be small and has expansion 
at most dj3^/\ogn\ogr < 0{d^/logn/logr) 

In the second case, there is no weak SSE flow, so the LP for the weak SSE flow must be 
infeasible, and its dual must be feasible. The original dual formulation has exponentially many 
variables, however in Appendix B.2.1 we mapped the solution to a concise representation using /^ 
vectors Zj's. Equations (15) to (20) are almost constraints of a semideflnite program, except for 
eq. (18). However, we can write the spreading constraint in more tractable way: 

Vi G y ^mm{0.9\\Zi\\l,\\Zj - Zi\\l} > 0.9 ||Z,||^ n - \\Zi\\l- -. 
jev 

This equation is clearly satisfled by the Zj's converted from the original dual solution, because there 
the number of vectors within 0.9 ||.^j||2 is at most 10n/9r, even if all of them are identical with Zj, 
the sum on the LHS can only be ||.Zi||2 ■ " away from its maximum possible value 0.9 ||.^j||2 n. 

On the other hand, if this equation is satisfled, we know for any i, the number of j such that 
\\Zj — Zi\\^ < 0.8 ||Zj||2 is at most lOn/r. This is very similar to Constraint (18) except the constants 
are larger. This increase in constant does not change anything in the proof of Lemma B.19. 

Therefore, we can solve the SDP to get a concise representation of the dual solution, and then 
apply Lemma B.19 to find a set that has size at most lOOn/r with expansion 0{d\/\og n/\/log r) . D 

C Planted Expander Model 

Our algorithm naturally applies to the planted expander model. In this model the graph has a 
planted bisection of expansion ^planted, the smaller side of the bisection has size pn. The induced 
subgraph on each side of the partition is an expander with expansion ^global ^ ^piantedy^^og n log 1/ p. 
In this case we can show the assumptions in Theorem 3.1 hold, and the algorithm gives a good 
approximation to sparsest cut. 
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This result is similar to the "planted spectral expander model" in [MMV12]. The main difference 
is that they assume the induced graphs of the partition have algebraic expansion constant times more 
than ^planted- Noticc that our combinatorial expansion property only implies algebraic expansion of 
$p;^„^g^ log n log 1/p, which might be smaller than ^planted if the planted bisection is sparse enough. 
Unfortunately our result only applies to regular graphs, therefore a comparison is not possible per 
se. Our formal guarantee is given in the following theorem. 

Theorem C.l. Assume graph G = iy,E) is a regular graph with an unknown planted bisection 
[S,V\S). The size of S is pn (p < 1/2J with ^{S) = ^planted- If the induced subgraphs of S 
and V\S both have expansion $ ^ t^<1>(S')4 /lognlog — , then the algorithm in Theorem 3.1 with 
r = 0(1/ p) gives a (1 + e) approximation to sparsest cut. 

Proof. We only need to show the assumptions in Theorem 3.1 are satisfied: Sets of size pn/2 should 
have sparsity at least ^{(f>sparsest\/^ognlog ^e~^'^). Since sparsity and expansion are within a 



sparsesty ^"-"6 "' i^Jfe p^ 



constant factor, we will show the expansion of small sets are at least A = 0(<1>(S') i/log n log —/e^'^ 
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For any set T of size at most pn/2, let Ti he T D S and T2 be T n {V\S). By assumption we 
know E{Ti, S\Ti) > jTil • A and E{T2, V\{S U Ts)) > \T2\A. Hence 

^ E{T,V\T) ^ E{Ti,S\n) + EiT2,V\iSUT2)) 

\ } \rp\ — \rp I I \rp I ^^ 
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